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PATENTS 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re application of 

Alan Bradley JONES et al. 

Serial No. (unknown) 

Filed herewith 

APPARATUS AND SYSTEM FOR 
CLASSIFYING AND CONTROL 
ACCESS TO INFORMATION 

PRELIMINARY AMENDMENT 

Commissioner for Patents 

Washington, D.C. 20231 

Sir : 

Prior to calculation of the filing fee, please 
amend the above-identified application as follows: 

IN THE CLAIMS : 

7\mend claim 7 as follows: 

-~7. (Amended) The invention according to claim 
1 further comprising means for storing said one or more 
transmission characteristics . 

Amend claim 8 as follows: 

--8. (Amended) The invention according to claim 
1 wherein said one or more transmission characteristics 
include any one or more of network protocol, date and time 
stamps, size of transmission activities (text and image) , 
content type of transmission activities, pattern seen within 



Alan Bradley JONES et al. 
the content of the transmission and any other characteristic 
that can be employed for predicting classifications. 
Amend claim 9 as follows: 

--9. (7\mended) The invention according to claim 
1 wherein said one or more transmission characteristics are 
obtained from network packets or fragments thereof. 
%;i Amend claim 10 as follows: 

--10. (Amended) The invention according to claim 
1 wherein the analysing means includes profiling means for 
providing profiles of interactions based on said one or more 



m transmission characteristics . 

\J Amend claim 12 as follows: 

f1 ■ 

|4 --12. (Amended) The invention according to claim 

1 further comprising a knowledge base of predetermined 
profiles^ and the analysing means is adapted to predict a 
classification based on a comparison between the profile of 
information to be classified and predetermined profiles. -- 
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Alan Bradley JONES et al. 
REMARKS 

The above changes in the claims merely place this 
national phase application in the same condition as it was 
during the international phase, with the multiple dependen- 
cies being removed. 

Attached hereto is a marked-up version of the 
changes made to the claims by the current amendment. The 
attached page is captioned ^'VERSION WITH MARKINGS TO SHOW 
CHANGES MADE''. 

Respectfully submitted, 
YOUNG & THOMPSON 




Benolt Castel 
Attorney for Applicants 
Registration No. 35,041 
Customer No. 00466 
745 South 23^^ Street 
Arlington, VA 22202 
Telephone: 7 03/521-22 97 

September 4, 2001 
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VERSION WITH MARKINGS TO SHOW CHANGES MADE 

The claims have been amended as follows: 

7 . (Amended) The invention according to -arry — ai:=re 
of — claim-s 1 to — 6 further comprising means for storing said 
one or more transmission characteristics. 

8 . (Amended) The invention according to ^any — orte 
of — claim-s- 1 — — 7- wherein said one or more transmission 
characteristics include any one or more of network protocol^ 
date and time stamps, size of transmission activities (text 
and image) , content type of transmission activities, pattern 
seen within the content of the transmission and any other 
characteristic that can be employed for predicting classifi- 
cations . 

9. (Amended) The invention according to ^rrry — crrte 
■erf — claimis- 1 — to — 8- wherein said one or more transmission 
characteristics are obtained from network packets or 
fragments thereof . 

10. (Amended) The invention according to amy — orre 
of — claims 1 — to — 9 wherein the analysing means includes 
profiling means for providing profiles of interactions based 
on said one or more transmission characteristics. 

12. (Amended) The invention according to any one 
of claim-s — 1 — to — Hr 1 further comprising a knowledge base of 
predetermined profiles, and the analysing means is adapted 
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to predict a classification based on a comparison between 
the profile of information to be classified and predeter- 
mined profiles. 
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ABSTRACT OF THE DISCLOSURE 

An apparatus is provided for classifying 
information or content servers on a communications 
network including the Internet. The apparatus comprises 
element for obtaining one or more transmission character- 
istics of information on a path of the communications 
network and analysing element for predicting a classifi- 
cation of the information based on the one or more 
transmission characteristics. Typically the one or more 
transmission characteristics include any one or more of 
network protocol, data and time stamps, size of transmis- 
sion activities (text and image) , content type of 
transmission activities, pattern seen within the content 
of the transmission and any other characteristic that can 
be employed for predicting classifications. The appara- 
tus can be adapted to classify user profiles in accor- 
dance with the predicted classification. A knowledge 
base of predetermined profiles can be included, and the 
analysing element is adapted to predict a classification 
based on a comparison between the profile of information 
to be classified and the predetermined profiles. 
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APPARATUS AND SYSTEM FOR CLASSIFYING AND 



CONTROL ACCESS TO INFORMATION 



TECHNICAL FIELD OF THE INVENTION 



THIS INVENTION relates to apparatus and system for classifying infornnation 
on communications network and in particular but not limited to apparatus and 
system for classifying content servers and for selectively controlling access to 
classified content servers. 



The phenomenon growth of information technology has allowed many 
people to have access to diverse information on communications networks. The 
Internet in particular allows fetching of information from any cooperating 
computers or content servers located in different parts of the world by simply 
clicking references to the information. As the number of accessible computers or 
content servers and the amount of information over the communications network 
grow daily it becomes increasingly difficult to classify them manually. 

Known systems for controlling the types of information accessible on a 
network rely on comparing a requested destination with those on pre-determined 
Access Control Lists (ACL) or on word matching to determine whether to allow or 
deny access. This approach can be applied at the cl ient node prior to requesting the 
information or on any suitably intelligent network device capable of intercepting 
the request or subsequent reply prior to it reaching the requester. For example, in 
the case of an Internet browser running on a PC or work station, a request is made 
for an Internet resource such as a web site. A software program for monitoring such 
requests on the PC can be configured to scan a pre-determined list of site addresses 
for a match. If found, access to the site may be denied and a suitable message is 
then displayed informing the user that access is denied. Alternatively, the request 
may be allowed to proceed, but as data are received from the site they are scanned 
for checking a match with one or more sets of pre-determined words, word 
fragments or phrases. If a match is found the site is not displayed on the computer 
but instead there is shown a suitable message. Typically, this type of control 
software is installed on a PC or work station which does not have particulariy strict 
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access privileges. The control software can be easily removed, disabled or 
otherwise circumvented and thereby defeating the control system. 

A network device capable of intercepting the request or reply to a request, 
such as a proxy server, may perform similar actions using the same methods of web 
site matching. This is usually maintained by a network administrator with strict 
access rights* Also, a network requiring clients to connect through the network 
device in order to access the network can have its content control enforced. This 
allows content control of multiple clients from one central point. 

While these known systems do provide some access control abilities, there 
are several disadvantages. A system based on word or phrase matching can only 
match text and it therefore would allow access to undesired information comprising 
graphic images. Also, a single word may match a broad range of sites with quite 
different classes of information. As an example, when the word "sex" is used to 
match pornographic sites the system would also block access to other sites 
providing non offensive information such as articles on biology. 

A system based on an access control list of prohibited sites is much more 
selective. Access can only be denied when attempting to access the sites which are 
included in the lists. While a suitably large list could bar access to a great deal of 
undesirable information it is difficult to keep up to date due to the rapid increase 
in the number of new sites and removal of sites. 

The above systems also do not lend themselves to adaptation to other 
network protocols and services such as interactive chat, streaming video, email or 
encrypted data streams. Extending to different languages also poses a problem for 
globalisation of these systems. 

QBIECT OF THE INVENTION 

An object of the present invention is to alleviate or to reduce to a certain 
degree one or more of the above disadvantages. 

Another object of the present invention is provide an apparatus/system for 
classifying user profiles. 
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SUMMARY OF THE INVENTION 
In one aspect therefor the present invention resides in an apparatus for 
classifying information on connnnunications network. The apparatus comprises 
means for obtaining one or more transmission characteristics of information on a 
5 path of said communications network, and analysing means for predicting a 
classification of said information based on said one or more transmission 
characteristics. 

In a second aspect therefor the present invention resides in an apparatus for 
classifying content servers which are accessible on a communications network. The 

1 6 apparatus comprises means for obtaining one or more transmission characteristics 
of information provided by any of said content servers on a path of said 
communications network, and analysing means for predicting a classification of said 
information based on said one or more transmission characteristics. 

In a third aspect therefor the present invention resides in a computer 

15 program for classifying information which is accessible on a communications 
network. The program comprises means for obtaining one or more transmission 
characteristics of information on a path of said communications network, and 
analysing means for predicting a classification of said information based on said one 
or more transmission characteristics. 

20 In a fourth aspect therefor the present invention resides in a computer 

program for classifying content servers which are accessible on a communications 
network. The apparatus comprises means for obtaining one or more transmission 
characteristics of information provided by any of said content servers on a path of 
said communications network, analysing means for predicting a classification of 

25 said information based on said one or more transmission characteristics. 

in a fifth aspect therefor the present invention resides in an 
apparatus/computer program for classifying user profiles of users accessing 
information or content servers on a communications network. The 
apparatus/computer program comprises means for obtaining one or more 

30 transmission characteristics of information or information provided by any one of 
said content servers on a path of said communications network, analysing means 
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for predicting a classification of said information or said one content server based 
on said one or more transmission characteristics, and means for classifying user 
profile in accordance with the predicted classification. 

The above invention may also comprise means for storing said one or more 
transmission characteristics. 

Typically said one or more transmission characteristics include any one or 
more of network protocol, date and time stamps, size of transmission activities (text 
and image), content type of transmission activities, pattern seen within the content 
of the transmission and any other characteristic that can be employed for predicting 
classifications. 

In preference said one or more transmission characteristics are obtained from 
network packets or fragments thereof. 

It is also preferred that the analysing means includes profiling means for 
providing profiles of interactions based on said one or more transmission 
characteristics. Typically said profiling means is arranged to process said one or 
more transmission characteristics for providing any one or more of frequency of 
interaction, duration of interaction, duration of absence of interaction, patterns of 
transmission, average number of http links within an object of related sites, average 
number of like sites visited within a time frame, and statistics from said other 
characteristics, for forming interaction profiles. The analysing means can then use 
the profiles for predicting classifications. 

The invention may have a knowledge base of predetermined profiles, and 
the analysing means is adapted to predict a classification based on a comparison 
between the profile of information to be classified and predetermined profiles. 

Advantageously the invention may have means for updating the knowledge 
base so that the classification prediction may be enhanced following classifications. 

In order that the present invention can be more readily understood and be 
put into practical effect reference will now be made to the accompanying drawings 
which illustrate one preferred embodiment of the invention and wherein: 
BRIEF DESCRIPTION OF THE DRAWING 

Figure 1 is a schematic diagram of the apparatus according to the invention; 
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Figure 2 is a table of selected data of captured packets of a search engine 
using the apparatus shown in Figure 1; 

Figure 3 is a partial table of selected data of captured packets of a news web 
site using the apparatus shown in Figure 1; 

Figure 4 is a table of selected data of captured packets of an entertainment 
web site using the apparatus shown in Figure 1; 

Figure 5 is a table of selected data of captured packets of the web site of an 
e-commerce merchant using the apparatus shown in Figure 1; 

Figure 5 is a table of selected data of captured packets of the web site of 
another e-commerce merchant using the apparatus shown in Figure 1; 

Figure 7 is a table of selected data of captured packets of a pornography web 
site using the apparatus shown in Figure 1; 

Figure 8 is a table of selected data of captured packets of another 
pornography web site using the apparatus shown in Figure 1; 

Figure 9 is a table of model N1 results using the apparatus shown in Figure 

1; 

Figure 10 ts a table of model N2 results using the apparatus shown in Figure 

1; 

Figure 1 1 is a table of model N3 results using the apparatus shown in Figure 

1; and 

Figure 12 is a table of classification prediction confidence levels using the 
apparatus shown in Figure 1 . 

DESCRIPTION OF THE PREFERRED EMBODIMENT 

Referring initially to Figure 1 there is shown an apparatus 10 for classifying 
media or information flowing through a path of a communications network which 
in this case is the Internet. 

As can be seen, network traffic passing through the apparatus 1 0 is captured 
and analysed for providing statistics relating to interactions between two or more 
terminals (not shown). The captured traffic is first checked against a list of 
predetermined classifications to determine if it is known or unknown. 
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When the captured traffic is of an unknown classification, various models 
(to be described more fully below) are applied to the data set in the captured traffic 
in order to predict the content classification. The models use parameters derived 
from a knowledge base of previously classified data sets and fitness with these 
5 parameters to determine the classification of the content of the newly captured 
traffic. Thus, the web site sending the captured traffic is now classified and is added 
to the list of known classifications, 
y It should be noted that the embodiment of the apparatus 10 as described 

herein is for an analysis of transmission traffic using the HTTP protocol. The 
II 10 apparatus 10 according to the present invention is not restricted to HTTP, and 

is easily adaptable to analyse data carried within any networks using any known 
W protocol. Examples of the protocols include FTP, SMTP, NNTP, etc. 

h^^ Following classification the captured data set is stored in the knowledge 

base. As the knowledge base expands, more data are used for the model 
15 parameters. This refines the apparatus and results in improved predictive 
y performance. 

The sites that are deemed to include undesirable information are added to 
Access control lists (ACLs). The ACLs are used control the flow of content 
information between terminals. E.g. Undesired content information can be 
20 prevented from travelling further through the network by simply not forwarding it, 
or by replacing it, or by intercepting the request for such content information and 
modifying its destination. 

Classification of traffic from content servers are relatively static. On the other 
hand, user terminals that interact with these content servers are variable and their 
25 classifications are considered transient classifications. 

Whereas classifications of content servers form a model of the style of 
content residing on the server, transient classifications form a model of style of 
content being viewed by a user terminal, or content consumer. This in effect forms 
a behaviour profile of such a consumer. This profile can be used to tailor the 
30 content information to suit the consumer. 
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As mentioned earlier the apparatus 10 captures a set of observed data 
relating to a network interaction event, and provides a set of results indicating the 
classification of a resource or personality residing at each network node involved 
in the interaction. This is accomplished by applying various statistical models to a 
profile, and testing this against results obtained from profiles of known 
classifications. In this example of the invention this process is represented by the 
following formulas: 

X is an unknown profile to be classified; 

Profiles p1,p2,p3.,.pn are of known classifications; 

Models Ml,M2,M3...Mn are available to operate on these profiles; and 

C1,C2,C3...Cn are profile classifications. 

The population of a profile of classification CI, may be defined by the 
population of Ml (p). Ml (x) may be tested against the true population using any of 
the standard statistical hypothesis methods. 

A pre-determined set of media terminals of a classification are modelled by 
various models Ml, M2 .. Mn. Each model consists of an approach and a set of 
parameter, e.g linear regression, gradient and point of interception, so that for a 
single classification M1(p1,p2 pn), M2(q1,q2 qn) „ Mn(r1,r2 .. rn) are used to 
model the population from the classification. The models may be based on 
mathematical structures, or arbitrary rules. 

The models are continually refined as more network traffic passes through 
the apparatus 10, thereby increasing the population space from which the 
classifications are computed. 

A terminal may be permanently or transitionally defined in relation to a 
classification. A transitionally defined terminal may move between classifications 
based on the fitness of the observed traffic to the models of the various 
classifications. 

Figures 2 to 8 are tables of selected data of traffic for testing the profile of 
data during a network interaction with a content server to determine if it contains 
media content of a pornographic nature. Assumption is made that profiles for 
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content servers contain a variable which is the average size of graphical innages 
served. 

A nornnai distribution or similar non-deternninistic probability distribution is 
then used to test the hypothesis that the profile belongs to a population classified 
as pornographic. In this example, the population of the classification may be 
defined by the population of N(a,b) where N is the image size and a and b are the 
mean and variance respectively, based on a normal distribution. The average and 
standard deviation derived from the observed samples is tested against the true 
population using standard statistical hypothesis methods. 

In some cases this approach may be broadened to encompass analysis of 
variance methods with multiple dependant variables, to model the characteristics 
of a site. Traditional ANOVA or regressive techniques may be applied to model the 
media content, 

A variety of traditional deterministic and non-deterministic models may be 
applied to determine the hypothesis of profile classification. These may be changed 
or upgraded continually depending on the level of predictive power found. The 
functionality of models used is not limited to, but can include simple ru!es-of- 
thumb, deterministic and non-deterministic probability models, or arbitrary 
calculations. 

The choice of mode! is primarily dictated by the predictive power of that 
model against the population in question. 

Figures 2 through 8 show examples of basic data set that can be gathered by 
obsen/ing network traffic of a typical interaction between a client browser and a 
web server. 

Figures 9 to 1 1 illustrate a simple classification model. This model looks at 
the size, content and relationships of objects being transmitted by a content server. 
The outcome of this model is to determine if the media being transmitted has 
pornographic content. 
Classification: pornographic 
Standard Model: 
Nl(a,b) 
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Where N1 is the image size, a and b are the mean and variance respectively, based 

on a normal distribution. 

N2{c,d) 

Where N2 is the ratio of text to graphics, c and d are the total size of the text and 

graphic objects respectively. 

N3(e) 

Where N3 is the count of word patterns matched from a list of pre-determined 
words, and e is the text of an object. 

Observed Samples are given in the tables shown in Figures 2 to 8. 

For model Nl shown in Figure 9, there is applied the normal distribution 
hypothesis test to the observed samples deriving the results. 

The result shows confidence to the 93% and S7% level for sites 6 and 7 
respectively, that the sites belong to a population of pornographic sites. The other 
samples give much lower confidence levels. 

For model N2 shown in Figure 1 0, a simple rule is used to test if the ratio is 
below a pre-determined threshold. The results show that sites 2, 4, 5 and 7 are 
within the threshold rating. 

For Model N3 shown in Figure 11, a simple rule is used to test if the 
number of words matching a list of patterns, exceeds a pre-determined threshold. 
The results show that sites 5 and 7 exceed the threshold. 

A weighting formula is then applied to derive a final result as shown in 
Figure 12. 

Therefore, using this example model, the apparatus 10 would predict that 
sites 6 and 7 are probably serving media with pornographic content, whereas sites 
1 through 5 probably are not. 

The attached appendix shows an example of the set of rules, constants and 
formulas which determine a confidence prediction based on logistic regression. The 
rules are defined using "Submodel" and "Model" components to define individial 
data points, and aggregated data points. These are then referred to in the 
"ProbabilityAnalyser" equations which use standard predictive formulas. 
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Whilst the above has been given by way of illustrative example of the 
present invention many variations and modifications thereto will be apparent to 
those skilled in the art without departing from the broad ambit and scope of the 
invention as herein set forth. 
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CLAIMS 

1. An apparatus for classifying information on connmunications network, the 
apparatus comprises means for obtaining one or more transmission characteristics 
of information on a path of said communications network, and analysing means for 
predicting a classification of said information based on said one or more 
transmission characteristics. 

2. An apparatus for classifying content servers which are accessible on a 
communications network, the apparatus comprises means for obtaining one or 
more transmission characteristics of information provided by any of said content 
servers on a path of said communications network, and analysing means for 
predicting a classification of said information based on said one or more 
transmission characteristics. 

3. A computer program for classifying information which is accessible on a 
communications network, the program comprises means for obtaining one or more 
transmission characteristics of information on a path of said communications 
network, and analysing means for predicting a classification of said information 

based on said one or more transmission characteristics. 

4. A computer program for classifying content servers which are accessible on 
a communications network, the apparatus comprises means for obtaining one or 
more transmission characteristics of information provided by any of said content 
servers on a path of said communications network, analysing means for predicting 
a classification of said information based on said one or more transmission 
characteristics. 

5. An apparatus for classifying user profiles of users accessing information or 
content servers on a communications network, the apparatus comprises means for 
obtaining one or more transmission characteristics of information or information 
provided by any one of said content servers on a path of said communications 
network, analysing means for predicting a classification of said information or said 
one content server based on said one or more transmission characteristics, and 
means for classifying user profile in accordance with the predicted classification. 
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6. A computer program for classifying user profiles of users accessing 
information or content servers on a communications network, the program 
comprises means for obtaining one or more transmission characteristics of 
information or information provided by any one of said content servers on a path 
of said communications network, analysing means for predicting a classification of 
said information or said one content server based on said one or more transmission 
characteristics, and means for classifying user profile in accordance with the 
predicted classification. 

7. The invention according to any one of claims 1 to 5 further comprising 
means for storing said one or more transmission characteristics. 

8. The invention according to any one of claims 1 to 7 wherein said one or 
more transmission characteristics include any one or more of network protocol, 
date and time stamps, size of transmission activities (text and image), content type 
of transmission activities, pattern seen within the content of the transmission and 
any other characteristic that can be employed for predicting classifications. 

9. The invention according to any one of claims 1 to 8 wherein said one or 
more transmission characteristics are obtained from network packets or fragments 
thereof. 

10. The invention according to any one of claims 1 to 9 wherein the analysing 
means includes profiling means for providing profiles of interactions based on said 
one or more transmission characteristics. 

11. The invention according to claim 10 said profiling means is arranged to 
process said one or more transmission characteristics for providing any one or more 
of frequency of interaction, duration of interaction, duration of absence of 
interaction, patterns of transmission, average number of http links within an object 
of related sites, average number of like sites visited within a time frame, and 
statistics from said other characteristics, for forming interaction profiles, and the 
analysing means is adapted to use the profiles for predicting classifications. 

12. The invention according to any one of claims 1 to 11 further comprising a 
knowledge base of predetermined profiles, and the analysing means is adapted to 
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predict a classification based on a comparison between the profile of information 
to be classified and predetermined profiles. 

1 3. The invention according to claim 1 2 further comprising means for updating 
the knowledge base so that the classification prediction can be enhanced following 
classifications. 
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(57) Abstract 

An apparatus (10) is provided for classifying information or content servers on a conamunications network including the Internet. The 
apparatus (10) comprises means for obtaining one or more transmission characteristics of information on a path of said communications 
network and analysing means for predicting a classification of said infomiation based on said one or more transmission characteristics 
Typically said one or more transmission characteristics include any one or more of network protocol, date and time stamps size of 
transmission activities (text and image), content type of transmission activities, pattern seen within the content of the transmission and 
any other characteristic that can be employed for predicting classifications. The apparatus (10) can be adapted toclassify user profiles in 
accordance wuh the predicted classification. A knowledge base of predetermined profiles can be included, and the analysing means is 
adapted to predict a classification based on a comparison between the profile of information to be classified and the predetermined profiles 
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Appendix 

# — ^ : 

#Body Text Word Ratio Models 

SubModel Param AllWordCount WordList AilWords 

SubModel Param AllWordCount Context BODY 

# — - 

#Body Text Unique Word Ratio Models 
SubModel Parann AllWordCountUnique WordList AilWords 
SubModel Param AllWordCountUnique Context BODY 
SubModel Param AllWordCountUnique Mode Distinct 

# — _ 

#Meta Text Word Ratio Models 

SubModel Param AilMetaWordCount WordList AilWords 
SubModel Param AilMetaWordCount Context META 

# — 

#Alternate Text Word Ratio Models 

SubModel Param All Alternate WordCount WordList AilWords 
SubModel Param AllAlternate WordCount Context ALTERNATE 

# — 

#lmage models 

SubModel Param LargeGIFPictureCount Dimension 201 x 201 - 999 x 999 
SubModel Param LargeGIFPictureCount ImageType GIF 

SubModel Param ThumbnailGIFPictureCount Dimension 51 x 51 - 200 x 200 

SubModel Param ThumbnailGIFPictureCount ImageType GIF 

SubModel Param IconGIFPictureCount Dimension 5 x 5 - 50 x 50 

SubModel Param IconGIFPictureCount ImageType GIF 

SubModel Param AllGIFPictureCount ImageType GIF 

Model Exp LargeGIFPictureRatio RATIO(LargeGIFPictureCount, 

AllGIFPictureCount) 
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Model Exp ThumbnailGIFPictureRatio RATIO{ThumbnailGIFPictureCount, 
AllGIFPictureCount) 

Model Exp IconGIFPictureRatio RATIOdconGIFPictureCount, 

AllGIFPictureCount) 

# 

SubModet Param LargeJPEGPictureCount Dimension 201 x 201 - 999 x 999 
SubModel Param LargeJPEGPictureCount ImageType JPEG 

SubModel Param ThumbnailJPEGPictureCount Dimension 51 x51 - 200 x 200 

SubModel Param ThumbnailJPEGPictureCount ImageType JPEG 

SubModel Param IconJPEGPictureCount Dimension 5 x 5 - 50 x 50 

SubModel Param IconJPEGPictureCount ImageType JPEG 

SubModel Param AllJPEGPictureCount ImageType JPEG 

Model Exp LargeJPEGPictureRatio RATIO{LargeJPEGPictureCount, 

AtiJPEGPictureCount) 

Model Exp ThumbnailJPEGPictureRatio RATIO(ThumbnailJPEGPictureCount, 
AllJPEGPictureCount) 

Model Exp IconJPEGPictureRatio RATIOdconJPEGPictureCount, 

AllJPEGPictureCount) 

# 

SubModel Param LowDepthGlFPictureCount Depth 2 - 4 

SubModel Param LowDepthGlFPictureCount ImageType GIF 

SubModel Param MediumDepthGIFPictureCount Depth 5-6 

SubModel Param MediumDepthGIFPictureCount ImageType GIF 

SubModel Param HighDepthGIFPictureCount Depth 7-16 

SubModel Param HighDepthGIFPictureCount ImageType GIF 

Model Exp LowDepthGIFPictureRatio RATIO(LowDepthGlFPictureCount, 

AllGIFPictureCount) 

Model Exp MediumDepthGIFPictureRatio 
RATIO{MediumDepthGIFPictureCount, AllGIFPictureCount) 
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Model Exp HighDepthGIFPictureRatio RATIO{HighDepthGIFPictureCount, 

AllCIFPictureCount) 

# 

SubModel Param LowDepthJPEGPictureCount Depth 2 - 7 

SubModel Param LowDepthjPEGPictureCount ImageType JPEG 

SubModel Param MediumDepthJPEGPictureCount Depth 8-15 

SubModel Param MediumDepthJPEGPictureCount ImageType JPEG 

SubModel Param HighDepthJPECPictureCount Depth 16-36 

SubModel Param HighDepthJPEGPictureCount ImageType JPEG 

Model Exp LowDepthJPEGPictureRatio RATIO(LowDepthJPEGPictureCount, 

AllJPECPictureCount) 

Model Exp MediumDepthJPEGPictureRatio 

RATIO(MediumDepthJPEGPictureCount, AllJPEGPictureCount) 

Model Exp HighDepthJPECPictureRatio RATIO(HighDepthJPEGPictureCount, 

AllJPEGPictureCount) 

# . 

#Links Out Models 

SubModel Param AllLinkOutCount IncludeLocal FALSE 
# 

SubModel Param AVSLinkOutCount Classification ADULTVERIFICATION 

SubModel Param AVSLinkOutCount IncludeLocal FALSE 

Model Exp AVSLinkOutRatio RATIO(AVSLinkOutCount, AllLinkOutCount) 

# begin porn.conf 

#Body Text Word Count Models 

SubModel Param PornExtraHardWordCount WordFile 

models/dictionary/porn/pom_words_extrahard.txt 

SubModel Param PornHardWordCount WordFile 

models/dictionary/porn/porn_words hard.txt 
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SubModel Param PornMediumWordCount WordFile 
models/dictionary/porn/porn_words_medium.txt 
SubModel Param PornLiteWordCount WordFile 
5 models/dictionary/porn/porn_words_l ite.txt 

SubModel Param PornExtraLiteWordCount WordFile 

modeis/dictionary/porn/porn_words_extral ite.txt 

# 

#Unique Body Text Word Count Models 

10 

SubModel Param PornExtraHardWordCountUnique WordFile 

models/dictionary/porn/porn_words_extrahard-txt 

SubModel Param PornExtraHardWordCountUnique Mode Distinct 

SubModel Param PornHardWordCountUnique WordFile 

1 5 modeis/dictionary/porn/porn_words_hard.txt 

SubModel Param PornHardWordCountUnique Mode Distinct 

SubModel Param PornMediumWordCountUnique WordFile 

model5/dictionary/porn/porn_word5_medtum-txt 

SubModel Param PornMediumWordCountUnique Mode Distinct 

20 SubModel Param PornLiteWordCountUnique WordFile 
mode!s/dictionary/porn/porn_words_lite.txt 
SubModel Param PornLiteWordCountUnique Mode Distinct 
SubModel Param PornExtraLiteWordCount Unique WordFile 
models/dictionary/porn/porn_words_extralite.txt 

25 SubModel Param PornExtraLiteWordCount Unique Mode Distinct 
# 

#Body Text Word Ratio Models 

Model Exp PornTextWordRatioExtraHard RAT10(PornExtraHardWordCount, 
AllWordCount) 

30 Model Exp PornTextWordRatioHard RATIO(PornHardWordCount, 
AllWordCount) 
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Model Exp PornTextWordRatioMedium RAT10{PornMediumWordCoun£, 
AllWordCount) 

Model Exp PornTextWordRatioLite RATIO(PornLiteWordCount, AllWordCount) 
Model Exp PornTextWordRatioExtraLite RATIO{PornExtraLiteWordCount, 
AllWordCount) 

# — 

#Body Text Unique Word Ratio Models 
Model Exp PornTextWordRatioExtraHard Unique 
RATIO{PornExtraHardWordCountUnique, AllWordCountUnique) 
Model Exp PornTextWordRatioHardUnique 
RATIO(PornHardWordCountUnique, AllWordCountUnique) 

Model Exp PornTextWordRatioMediumUnique 

RATIO(PornMediumWordCountUnique, AllWordCountUnique) 

Model Exp PornTextWordRatioLiteUnique RATIO(PornLiteWordCountUnique, 

AllWordCountUnique) 

Model Exp PornTextWordRatioExtraLfteUnique 
RATtOCPornExtraLiteWordCountUnique, AllWordCountUnique) 
4i 

#Domain Word Count Models 

SubModel Param PornExtraHardDomainWordCount Context DOMAIN-NAME 
SubModel Param PornExtraHardDomainWordCount WordFile 
models/dictionary/porn/porn_words_extrahard.txt 

SubModel Param PornHardDomainWordCount Context DOMAIN-NAME 
SubModel Param PornHardDomainWordCount WordFile 
models/dictionary/porn/porn_words_hard.txt 

SubModel Param PornMediumDomainWordCount Context DOMAIN-NAME 
SubModel Param PornMediumDomainWordCount WordFile 
models/dictionary/porn/porn_words_medium.txt 

SubModel Param PornLiteDomainWordCount Context DOMAIN-NAME 
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SubMode! Param PornLiteDomainWordCount WordFile 
models/dictionary/porn/porn_words_lite.txt 

SubModel Param PornExtraLiteDomainWordCount Context DOMAIN-NAME 
SubModel Param PornExtraLiteDomainWordCount WordFile 
models/dictionary/porn/porn_words_extralite.txt 

# ~ - 

#Meta Text Word Count Models 

SubModel Param PornExtraHardMetaWordCount Context META 
SubModel Param PornExtraHardMetaWordCount WordFile 
models/dictionary/porn/porn_wordsextrahard.txt 
SubModel Param PornHardMetaWordCount Context META 
SubModel Param PornHardMetaWordCount WordFile 
models/dictionary/porn/porn_words_hard.txt 

SubModel Param PornMediumMetaWordCount Context META 

SubModel Param PornMediumMetaWordCount WordFile 

models/dictionary/porn/porn_word5_medium.txt 

SubModel Param PornLiteMetaWordCount Context META 

SubModel Param PornLiteMetaWordCount WordFile 

model5/dictionary/porn/porn_words__lite.txt 

SubModel Param PornExtraLiteMetaWordCount Context META 

SubModel Param PornExtraLiteMetaWordCount WordFile 

models/dictionary/porn/porn_words_extralite.txt 

# — 

#Meta Text Word Ratio Models 

Model Exp PornMetaWordRatioExtraHard 

RATIO{PornExtraHardMetaWordCount, AllMetaWordCount) 

Mode! Exp PornMetaWordRatioHard RAT10(PomHardMetaWordCount, 

AllMetaWordCount) 
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Model Exp PornMetaWordRatioMedium RATIO(PornMediumMetaWordCount, 
AllMetaWordCount) 

Model Exp PornMetaWordRatioLite RATIO(PornLiteMetaWordCount, 
AllMetaWordCount) 

Model Exp PornMetaWordRatioExtraLite RATIO(PomExtraLiteMetaWordCount, 
AllMetaWordCount) 

# 

#Alternate Text Word Count Models 

SubModel Parann PornExtraHardAlternateWordCount Context ALTERNATE 

SubModel Param PornExtraHardAlternateWordCount WordFile 

models/dictionary/porn/porn_words_extrahard.txt 

SubModel Parann PornHardAlternateWordCount Context ALTERNATE 

SubModel Param PornHardAlternateWordCount WordFile 

models/dictionary/porn/porn_word5_hard.txt 

SubModel Param PornMediumAlternateWordCount Context ALTERNATE 

SubModel Param PornMediumAlternateWordCount WordFile 

modeis/dictionary/porn/porn_words_medium.txt 

SubModel Param PornLiteAlternateWordCount Context ALTERNATE 

SubModel Param PornLiteAlternateWordCount WordFile 

models/dictionary/porn/porn_words_lite.txt 

SubModel Param PornExtraLiteAlternateWordCount Context ALTERNATE 

SubModel Param PornExtraLiteAlternateWordCount WordFile 

models/dictionary/porn/porn_words__extralite.txt 

# — 

#A!ternate Text Word Ratio Models 

Model Exp PornAlternateWordRatioExtraHard 

RATIO(PornExtraHardAlternateWordCount, AllAlternateWordCount) 

Model Exp PornAlternateWordRatioHard RATIO(PornHardAlternateWordCount, 

AllAiternateWordCount) 
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wo 00/52598 



PCT/AUOO/00158 



Model Exp PornAlternateWordRatioMedium 

RATIO(PornMediumAlternateWordCount, AllAlternateWordCount) 
Mode! Exp PornAiternateWordRatioLite RATIO(PornLiteAlternateWordCount, 
5 Al [Alternate WordCount) 

Model Exp PornAlternateWordRattoExtraLite 

RATIO(PornExtraLiteAlternateWordCount, AllAlternateWordCount) 

# . 

#Links Out Models 
10 SubModel Parann PornLinkOutCount Classification PORN 
SubModel Param PornLinkOutCount IncludeLocal FALSE 
Model Exp PornLinkOutRatio RATIO(PornLinkOutCount, AIILinkOutCount) 
# 

#Logistic Models 
1 5 Model Exp PornLRConstant -3.9869 
# 

Model Exp PornLRCoefficientPornTextWordRatioExtraHard 39.7450 

Model Exp PornLRCoefficientPornTextWordRatioHard 355.0550 
20 Model Exp PornLRCoefficientPornTextWordRatioMedium -1 36.436 
Model Exp PornLRCoefficientPornTextWordRatioLite -63.2565 
Model Exp PornLRCoefficientPornTextWordRatioExtraLite 33.9054 
# 

Model Exp PornLRCoefficientPornTextWordRatioExtraHardUnique 1 11. 4752 
25 Mode! Exp PornLRCoefficientPornTextWordRatioHard Unique -72.7005 

Model Exp PornLRCoefficientPornTextWordRatioMediumUnique 264.1902 
Mode! Exp PornLRCoefficientPornTextVVordRatioLiteUnique 125.0743 
Model Exp PornLRCoefficientPornTextWordRatioExtraLiteUnique -16.6895 
# 

30 Model Exp PornLRCoefficientPornExtraHardDomainWordCount 0.2598 
Model Exp PornLRCoefficientPornHardDomainWordCount 2.1344 
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Model Exp PornLRCoefficientPornMediumDornainWordCouni: 0 
Model Exp PornLRCoefficientPornLiteDomainWordCount 0.0610 
Model Exp PornLRCoefficientPornExtraLiteDomainWordCount 0 
# 

Model Exp PornLRCoefficientPornMetaWordRatioExtraHard 0 
Model Exp PornLRCoefficientPornMetaWordRatioHard 0 
Model Exp PornLRCoefficientPornMetaWordRatioMedium 0 
Model Exp PornLRCoefficientPornMetaWordRatioLite 0 
Model Exp PornLRCoefficientPornMetaWordRatioExtraLite 0 
# 

Model Exp PornLRCoefficientPornAlternateWordRatioExtraHard 16.1972 

Model Exp PornLRCoefficientPornAlternateWordRatioHard 0 

Model Exp PornLRCoefficientPornAlternateWordRatioMediunn 26.4186 

Mode! Exp PornLRCoefficientPornAlternateWordRatioLite 0 

Model Exp PornLRCoefficientPornAlternateWordRatioExtraLite 14.1615 

# 

Model Exp PornLRCoefficientAllGIFPictureCount 0 
Mode! Exp PornLRCoefficientLargeGIFPictureCount 0 



Model Exp PornLRCoefficientlconGIFPictureCount 0 
Model Exp PornLRCoefficientThumbnailGIFPictureCount 0 
Model Exp PornLRCoefficientLargeGIFPictureRatio 0 
Mode! Exp PornLRCoefficientlconGIFPictureRatio 0 
Model Exp PornLRCoefficientThumbnailGIFPictureRatio 0 
Model Exp PornLRCoefficientHighDepthGIFPictureCount 0 
Model Exp PornLRCoefficientMediumDepthGIFPictureCount 0 
Model Exp PornLRCoefficientLowDepthGIFPictureCount 0 
Model Exp PornLRCoefficientHighDepthGIFPictureRatio 0 
Model Exp PornLRCoefficientMediumDepthGIFPictureRatio 0 
Mode! Exp PornLRCoefficientLowDepthGIFPictureRatio 0 
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Model Exp PornLRCoefficientAIIJPEGPictureCount 0 
Mode! Exp PornLRCoefficientLargeJPEGPictureCount 0 
5 Model Exp PornLRCoefficientlconJPEGPictureCount 0 

Model Exp PornLRCoeffjcientThumbnailJPEGPictureCount 0 
Model Exp PornLRCoefficientLargeJPEGPictureRatio 0 
Model Exp PornLRCoefftcientlconJPEGPictureRatio 0 
Model Exp PornLRCoefficientThumbnailJPEGPictureRatio 0 

10 Model Exp PornLRCoefficientHighDepthJPEGPictureCount 0 

Model Exp PornLRCoefficientMediumDepthJPEGPictureCount 0 
Model Exp PornLRCoefficientLowDepthJPEGPictureCount 0 
Model Exp PornLRCoefficientHighDepthJPEGPictureRatio 0 
Model Exp PornLRCoefficientMediumDepthJPEGPictureRatio 0 

15 Mode! Exp Porn LRCoefficientLowDepthJ PEG PictureRatio 0 
# 

Model Exp PornLRCoefficientPornLinkOutRatio 4.6958 
Model Exp PornLRCoefficientAVSLinkOutCount 0.3327 
Model Exp PornLRCoefficientAVSLinkOutRatio 3.6786 

20 # 

Model Exp PornLRLogOdds SUM(PornLRConstant, \ 

PRODUCKPomLRCoefficientPornTextWordRatioExtraHard, 
PornTextWordRatioExtraHard), \ 

PRODUCT(PornLRCoefficientPornTextWordRatioHard, 
25 PornTextWordRatioHard), \ 

PRODUCKPornLRCoefficientPornTextWordRatioMedium, 
PornTextWordRatioMedium), \ 

PRODUCT(PornLRCoefficientPornTextWordRat!oLite, 
PornTextWordRatioLite), \ 
30 PRODUCKPornLRCoefficientPornTextWordRatioExtraLite, 
PornTextWordRatioExtraLite), \ 
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PRODUCKPornLRCoefficientPornTextWordRatioExtraHardUnique, 
PornTextWo rd Rat ioExt ra H ard U n i q ue), \ 

PRODUCT(PornLRCoefficientPornTextWordRatioHardUnique, 
PornTextWordRatioHardUnique), \ 

PRODUCT{PornLRCoefficientPornTextWordRatioMediumUnique, 
Po rnText Word Rati oMed i u m U n i q ue), \ 

PRODUCT(PornLRCoefficientPomTextWordRatioLiteUniquG, 
PornTextWordRatioLiteUnique), \ 

PRODUCKPornLRCoefficientPornTextWordRatioExtraLiteUnique, 
PornTextWordRatioExtraLiteUnique), \ 

PRODUCKPornLRCoefficientPornExtraHardDomainWordCount, 
PornExtraHardDomainWordCount), \ 

PRODUCT(PornLRCoefficientPornHardDomainWordCount, 
PornHardDomainWordCount), \ 

PRODUCT{PornLRCoefficientPornMediumDomainWordCount, 
PornMedium Domain WordCount)^ \ 

PRODUCKPornLRCoefficientPornLiteDomainWordCount, 
PornLiteDomainWordCount), \ 

PRODUCT(PornLRCoefficientPornExtraLiteDomainWordCount, 
PornExtraLiteDomainWordCount), \ 

PRODUCKPornLRCoefficientPornMetaWordRatioExtraHard, 
PornMetaWordRatioExtraHard), \ 

PRODUCKPornLRCoefficientPornMetaWordRatioHard, 
PornMetaWordRatioHard), \ 

PRODUCKPornLRCoefficientPornMetaWordRatioMedium, 
PornMetaWordRatioMedium), \ 

PRODUCT(PornLRCoefficientPornMetaWordRatioLite, 
PornMetaWordRatioLite), \ 

PRODUCKPomLRCoefficientPornMetaWordRatioExtraLite, 
PornMetaWordRatioExtraLite), \ 
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PRODUCKPornLRCoefficientPornAlternateWordRatioExtraHard, 
PornAlternateWordRatioExtraHard)/ \ 

PRODUCKPornLRCoefficientPornAlternateWordRatioHard, 
PornAlternateWordRatioHard), \ 

PRODUCT{PornLRCoefficientPornAlternateWordRatioMedium, 
PornAlternateWordRatioMedium), \ 

PRODUCKPornLRCoefficientPornAlternateWordRatioLite, 
PornAhernateWordRatioLite), \ 

PRODUCT{PornLRCoefficientPornAlternateWordRatioExtraLite, 
Po rn A I te rn ate Word Rat ioExtra L i te), \ 

PRODUCKPornLRCoefficientAllGIFPictureCount, AllGIFPictureCount), \ 

PRODUCT(PornLRCoefficientLargeGIFPictureCount, 
LargeCIFPictureCount), \ 

PRODUCT(PornLRCoefficientlconGIFPictureCount, 
IconGIFPictureCount), \ 

PRODUCT{PornLRCoefficientThumbnat!CIFPictureCount, 
ThumbnailGtFPictureCount), \ 

PRODUCT(PornLRCoefficientLargeGlFPictureRatio, 
LargeGiFPiaureRatio), \ 

PRODUCKPornLRCoefficientlconGIFPictureRatio, IconGIFPictureRatio), \ 

PRODUCT(PornLRCoefficientThumbnailGIFPictureRatio, 
ThumbnailGIFPictureRatio), \ 

PRODUCKPornLRCoefficientHighDepthGiFPictureCount, 
HighDepthGlFPictureCount), \ 

PRODUCT(PornLRCoefficientMediumDepthGlFPictureCount, 
MediumDepthGIFPictureCount), \ 

PRODUCT{PornLRCoefficientLowDepthGlFPictureCount, 
LowDepthGlFPictureCount), \ 

PRODUCT{PornLRCoefficientHighDepthGIFPictureRatio, 
HighDepthGlFPictureRatio), \ 
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PRODUCKPornLRCoefficientMediumDepthCIFPictureRatio, 

MediumDepthGlFPictureRatio), \ 

PRODUCT(PornLRCoefficientLowDepthGIFPictureRatio, 

5 LowDepthGIFPictureRatio), \ 

PRODUCKPornLRCoefficientAllJPEGPictureCount, AllJPEGPictureCount), 

\ 

PRODUCT{PornLRCoefficientLargejPEGPictureCount, 
LargeJPEGPictureCount}, \ 
1 0 PRODUCKPornLRCoefficientlconjPEGPictureCount, 
iconJPEGPictureCount), \ 

PRODUCT(PornLRCoefficientThumbnailJPEGPictureCount, 
ThumbnailJPEGPictureCount), \ 

PRODUCT(PornLRCoeffictentLargeJPECPictureRatio, 

1 5 LargejPECPictureRatio), \ 

PRODUCT(PornLRCoefficientlconJPECPictureRatio, 

IconJPEGPictureRatio), \ 

PRODUCTCPornLRCoefficientThumbnaiijPEGPictureRatio, 

ThumbnailJPEGPictureRatio), \ 
20 PRODUCT(PornLRCoefficientHighDepthJPEGPictureCount, 
HighDepthJPEGPictureCount), \ 

PRODUCKPornLRCoefficientMediumDepthjPEGPictureCount, 
MediumDepthJPEGPictureCount), \ 

PRODUCT(PornLRCoefficientLowDepthJPEGPictureCount, 

25 LowDepthJPEGPictureCount), \ 

PRODUCT(PornLRCoefficientHighDepthJPEGPictureRatio, 
HighDepthJPEGPictureRatio), \ 

PRODUCT(PornLRCoefficientMediumDepthJPEGPictureRatio, 
MediumDepthjPEGPictureRatio), \ 
30 PRODUCT(PornLRCoefficientLowDepthJP6GPictureRatio, 
LowDepthJPEGPictureRatio), \ 
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PRODUCTCPornLRCoefficientPornLinkOutRatio, PornLinkOutRatio), \ 
PRODUCT(PornLRCoefficientAVSLinkOutCount, AVSLinkOutCount), \ 
PRODUCKPornLRCoefficientAVSLinkOutRatio, AVSLinkOutRatio)) 

5 

#Probability Analysers 

ProbabilityAnalyser Param PornAltMetaWordCountProbability Classification 
PORN 

ProbabilityAnalyser Exp PornAltMetaWordCountProbability \ 
10 SUM(PornExtraHardMetaWordCount, PornHardMetaWordCount, \ 

PRODUCT{0.5,PornMediumMetaWordCount), \ 
PornExtraHardAlternateWordCount, 
Porn Hard Alternate WordCount, \ 

PRODUCT(0.5,PornMediumAlternateWordCount)) 
15 ProbabilityAnalyser Param PornMetaWordRatioProbability Classification PORN 
ProbabilityAnalyser Exp PornMetaWordRatioProbability \ 

PRODUCTd 00, SUM(PornMetaWordRatioExtraHard, \ 

PornMetaWordRatioHard, PornMetaWordRatioMedium)) 
ProbabilityAnalyser Param PornLRProbability Classification PORN 
20 ProbabilityAnalyser Exp PornLRProbability PRODUCTdOO, 
RAT!0(l,SUM(l,EXP(MiNUS(PornLRLogOdds))))) 
# 
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Site 1 Objects ■ Content Type "Search Engine" 





Source ' 


Destination 








Object 


IP Address 


Port 


IP MdTBZS 


Port 


Type 
is: 


Size 


TImestamn 


•| 


202.139.16.45 


63450 


204.71 .200.72 


80 


GET 


3€ 


14:53:17 


p 


504 71 200 72 


80 


202 139 16 45 


63450 


text/htm i 


942^ 


» *T.O\J, 1 / 


O 

o 


202 1 39 1 6 45 


63450 


204 71 200 72 


80 


GPT 


7Q 


I*f .Do. 1 y 


A 
*\ 






204 71 200 72 


80 


GET 


AP^ 




c 
O 


202.139,16.45 


63450 


204.71.200.72 


80 




ou 




c 


204.71.200.72 


80 


202.139.16.45 


63450 


Image/gif 


2637 




7 


204.71.200.72 


80 


202.139.16.45 


63450 


image/gif 


4672 


14:53:20 


8 


204.71.200.72 


80 


202.139.16.45 


63450 


image/gif 


357 


14:53:20 


9 


202,139.16.45 


63450 


204.71.200.72 


80 


GET 


56 


14:53:27 


10 


204.71.200.72 


SO 


202.139.16.45 


63450 


text/htmi 


11193 


14:53:28 


11 


202.139.16.45 


63450 


204.71,200.72 


80 


GET 


59 


14:53:29 


12 


204.71.200.72 


80 


202.139.16.45 


63450 


image/gif 


11522 


14:53:29 


13 


202.139.16.45 


63450 




80 


GET 


67 


14:53:30 


14; 


202.139.16.45 


63450 


204.71 .200.72 


80 


GET 


67 


14:53:30 


15 


202.139.16.45 


63450 


204.71,200,72 


80 


GET 


59 


14:53:30 


16 


204.71.200.72 


80 


202.139,16.45 


63450 


image/gif 


1398 


14:53:30 


17 


202.139.16.45 


63450 


204.71.200.72 


SO 


GET 


76 


14:53:30 


18 


204.71.200.72 


80 


202-139.16.45 


63450 


rmage/gif 


1728 


14:53:30 


19 


202.139.16.45 


63450 


204.71.200.72 


80 


GET 


69 


14:53:30 


20 


204.71 .200.72 


80 


202.139.16.45 


63450 


mage/gif 


962 


14:53:30 


21 


204.71.200-72 


80 


202.139.16.45 


63450 J 


mage/gif 


946 


14:53:31 


22 


204.71.200.72 [ 


80 


202.139.16,45 


63450 [image/gif : 


1716 


14:53:31 | 
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Site 2 Objects - Content Type 


"News" 






Ob\ect 


Source 


Destination 






■ 1 ■■ — - 

Timestam p 


iP Address 


Port 


IP Address ' 


Port 


Xype 




1 202-139.16.45 


6345C 


) 165.69.1.187 


8C 


s l:3c: f 




i i4.D4:04 




I 165mi.187 


ec 


) 202.139.16.45 


6345C 


) text/niml 


231/: 


I 14:54:04 




] 202,139.18,45 


6345C 


) 165.69-1.187 


SC 


) GET 


52 


I 14:54:05 


I 


[ 202.139.16.45 


6345C 


) 165.69.1.187 


8C 


) GET 


5e 


i 14:54:05 


c 


i 202.139.16.45 


63450 


165.69.1.187 


8C 


GET 


46 


; 14:54:05 


€ 


202.139.16.45 


63450 


165.69,1.187 


80 


GET 


48 


14:54:05 


7 


202.139.16.45 


63450 


165.69.1.187 


80 


GET 


47 


14:54:05 


8 


202.139.16.45 


63450 


165.69.1.1 B7 


80 


GET 


47 


14:54:05 


9 


165.69,1.187 


SO 


202.139.16.45 


63450 


text/html 


333 


14:54:05 


10 


165.69.1.187 


80 


202.139.16.45 


53450 


text/html 


56 


14:54:06 


11 


165.69.1.187 


80 


202.139.16.45 


63450 


lextmtml 


2445 


14:54:06 


12 


165-69.1.187 


80 


202.139,16.45 


63450 


text/html 


202 


14:54:06 


13 


165.69.1.187 


80 


202.139.16.45 


63450 


text/htmt 


202 


14:54:06 


14 


165.69.1.187 


80 


202.139-16.45 


63450 


text/html 


56 


14:54:06 


15 


202.139.16.45 


63450 


165.69.1.187 


80 


GET 


71 


14:54:06 


16 


165.69.1.187 


80 


202.139-16.45 


63450 


image/gif 


1229 


14:54:06 


17 


202.139.16.45 


63450 


165.69.1,187 


80 


GET 


64 


14:54:06 


18 


165.69.1.187 


80 


202.139.16.45 


63450 


image/gif 


43 


14:54:06 


19 


202.139.16.45 


63450 


165.69.1.187 


80 


GET 


64 


14:54:06 


20 


202.139.16.45 


63450 


165.69.1.187 


80 


GtlT 


64 


14:54:07 




165.69.1.187 


80 


202.139.16.45 


63450 


irn3g©/gif 




1 4:54:07 




165.63.1.187 


80 


202.139-16.45 


B3450 


im 0 n ci /mf 
u X \ c;/ y 1 1 




1 ^ .O-H- .KJ i 


23 


202.139.16.45 


63450 


1 65.69.1 .187 


80 


GET 


67 


1 4:54:07 


24 


1 bO-by, J .1 o / 


oU 




DO*tDU 


image/g'ff 


2442 


14:54:07 


25 


202.139.16.45 


63450 


165.69.1.187 


80 


GET 


73 


14:54:07 


26 




AO 






image/gif 


1364 


14:54:07 


27 


202.139.16.45 


63450 


165.69.1.187 


80 


GET 


71 


14:54:07 


28 


165.69.1.187 


80 


202.139-16.45 


63450 


image/gif 


8942 


14:54:07 


29 


202.139.16.45 


63450 


165.69-1.187 


80 


GET 


71 


14:54:07 


30 


202.139.16.45 


63450 


165.69.1.187 


80 


GET 


65 


14:54:08 


-31 


165.69.1.187 


80 


202.139.16-45 


63450 


unknown 


10 


14:54:08 


32 


165,69.1.187 


80 


202.139.16.45 


63450 


image/gif 


15550 


14:54:08 


33 


202.139.16.45 


63450 


165.69.1.187 


80 


GET 


77 


14:54:08 


34 


165.69.1.187 


80 


202-139.16.45 


63450 


image/gif 


4732 


14:54:03 


35 


202.139.16.45 


63450 


165.69.1.187 


80 


GET 


70 


14:54:08 


36 


202.139.16.45 


63450 


165.69.1.187 


80 


GET 


70 


14:54:09 


37 


202.139.16.45 


63450 


165.69.1.187 


80 


GET 


68 


14:54:09 


38 


202.139.16.45 


63450 


165.69.1.187 


80 


GET 


68 


14:54:09 


39 


165.69.1-187 


80 


202.139.16,45 


63450 


image/gif 


436 


14:54:09 


40 


165-69.1.187 


80 


202-139-16-45 


63450 


image/gif 


405 


14:54:09 


41 


165.69.1.187 


80 


202.139.16.45 


63450 


image/gif 


436 


14:54:09 


42 


165.69.1.187 


BO 


202.139.15.45 


63450 


image/gif 


405 


14:54:09 
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Site 3 Objects - Content Type "Entertainment" 






Source 


Destination 










Object 


IP Address 


Port 


IP Aoaress 


Port 


Type 


Size 


Tlmestamp 






I 202.139.16.45 


6345C 


) 204.202,129.2, 


3: 8 


D GET 


4 


0 14:55:10 






I 204.202.129.2 


3 SC 


) 202.139.16.45 


63451 


D text/htm i 


5 


3 14:55:11 






3 202.139.16.45 


6345C 


) 204.202.129.2; 


3. 81 


} GET 


3J 


3 14:55:11 






[ 204.202.129.2: 


3 8C 


) 202.139.16.45 


6345{ 


3 text/html 


2576^ 


5 14:55:12 






i 202.139.16.45 


6345C 


f 204.202.129.2: 


I 8C 


) GET 


5^ 


B 14:55:13 




e 


(202.139.16.45 


63450 


204.202.129.2: 


3. 8C 


) GET 


6£ 


5 14:55:13 




7 


202.139.16.45 


63450 


204.202.129.2: 


3 8C 


) GET 


62 


2 14:55:13 




8 


204.202.129.2: 


5 80 


202.139.16.45 


6345C 


) image/jpe< 


3 5277 


r 14:55:14 




9 


204.202.129.2c 


5 80 


202.139.16.45 


6345C 


) image/gi'f 


42 


i 14:55:14 




1 10 


204.202. 129.2G 


i 80 


202.139.16.45 


63450 


image/gif 


1266 


} 14:55:14 




11 


202.139.16.45 


63450 


204.202.129.23 


80 


GET 


63 


14:55:20 




12 


202.139.16.45 


83450 


204.202.129-23 


80 


GET 


58 


[14:55:21 




13 


202.139,16.45 


83450 


204.202.129.23 


80 


GET 


72 


{ 14-5^-Pi 




14 


204.202.129.23 


80 


202.139.16:45 


63450 


imsnp/nif 


1733 




uJ 


15 


204.202.129.23 


80 


202.139.16.45 


63450 


imaop/nif 


5314 




if 


16 


204.202.129.23 


80 


202.139.16.45 


63450 


image/gif 


414 






17 


202.139.16.45 


63450 


204.202.129.23 


80 


GET 


68 




rij 


18 


202.139.16-45 


63450 


204.202.129.23 


80 


GET 


62 






1Q 


204.202.129.23 


80 


202.139-16.45 


63450 


imarjp/nif 


406 






20 


204.202-129.23 


80 


202.139.16.45 


63450 


imsnp/nif 


746 




CI 


21 


202.139.16.45 


63450 


204.202.129.23 


80 


GET 


65 




22 


202,139.16.45 


63450 


204.202.129.23; 


80 


GET 


58 






23 


202.139.16.45 


63450 


204.202.129.2a 


80 


GET 


63 


14:55:23 




24 


202.139.16,45 


63450 


204.202.129.23 


80 


GET 


62 


14:55:23 




25 


204,202-129.23 


80 


202.139.18.45 


53450 


image/gif 


1655 


14:55:23 




26 


204.202.129.23 


80 


202.139.16.45 


63450 


image/gif 


35 


14:55:24 




. 27 


204.202.129.23 


80 


202.139.16.45 


63450 


image/gif 


906 


14:55:24 




28 


204.202.129.23 


80 


202-139.16.45 


63450 


image/gif 


447 


14:55:24 




29 


202-139.16.45 


63450 


204.202.129.23 


80 


GET 


67 


14:55:24 




30 


202.139.16.45 


63450 


204.202.129.23; 


80 


GET 


58 


14:55:24 




31 


202.139.16.45 


53450 


204.202.129.23, 


80 


GET 


62 


14:55:24 




32 


204.202.129.23 


80 . 


202.139.16.45 


63450 


image/jpec 


7861 


14:55:24 




33 


204.202.129.23 


SO i 


202.139.16.45 


63450 


image/gif 


391 


14:55:25 




34 


204.202.129.23 


80 i 


TOO -t opi -i rr^ ATI 

202.139.1 5.45 


63450 


fmage/gif 


641 


14:55:25 




35 








oU 


GET 


57 


14:55:25 




36 


204.202-129.23 


80 ; 


202.139.16.45 


63450 


mage/gif 


377 


14:55:25 




o / 


202.139.16.45 


63450 ; 


204.202.129.23: 


80 


Lad 1 








38 


202.139.16.45 


63450 : 


204.202.129.23; 


80 


GET 


78 


14:55:25 




39 


202.139.16.45 


63450 : 


204.202,129.23: 


80 < 


GET 


74 


14:55:26 




40 


202.139.16.45 


63450 1 


204.202.129.23 


80 ( 


3ET 


67 


14:55:26 




41 


204.202.129.23 


SO i 


202-139.16.45 


63450 t 


mage/gif 


403 


14:55:25 




42 


204.202.129.23 


80 1 


202.139.16.45 


63450 i 


mage.'^gif 


1796 


14:55:26 




43 


204.202.129.23 


80 ^ 


^02.139.16.45 


63450 i 


mage/gif 


6845 ' 


14:55:27 




44 ; 


202,139.16.45 


63450 ^ 


?04.202. 129.23 


SO ( 


3ET 


56 ■ 


f4:55:27 




45 : 


204.202.129.23 


80 : 


?02.139. 16.45 


63450 i 


mage/jpec 


17796 ■ 


14:55:27 




46 ; 


202.139.16.45 


53450 1 


104.202.129.23: 


80 < 


3ET 


56 - 


14:55:27 




47 1 


204.202.129.23 


80 ^ 


?02.139. 16.45 


63450 f 


mage/gif 


49 ' 


14:55:27 




4a : 


204.202.129.23 


80 2 


?02.139.16.45 


63450 i 


mage/gif 


44 - 


4:55:27 
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She 4 Objects - Content Type "Computer Hardware" 



Object 


Source 


DesWnaWon 




Size 


T\mes\amp 


IP Address 


Pon 


\P Address 


Port 


Type 


1 


202.139,16-45 


6345C 


) 209.143.240.6 


8{ 


} GET 


3^ 


X 14:55:49 




\ 209.143.240.6 


8C 


) 202.139.16.45 


6345{ 


D text/html 


164f 


3 14:55:50 


U G 


1202.139.16.45 


6345C 


) 209.143.240.6 


8C 


) GET 


Al 


^ 14:55:51 


Q A 


209.143.240,6 


8C 


) 202.139.16.45 


6345C 


} text/html 


5€ 


> 14:55:51 


% 5 


202.139,16.45 


6345C 


) 209.143-240.6 


8C 


) GET 


64 


^ 14:55:53 


IS 6 


202.139.16.45 


6345G 


209.143.240.6 


SC 


) GET 


62 


t 14*55*53 


a. 7 




63450 


209 143 240 6 


fir 


GET 


63 


14*55*53 


.1 8 


PQ9 1 43 240 6 


80 


202.139.16.45 


63450 


image/gif 


290 


1 4:5S:53 


J 9 


PnP 139 1 6 45 


63450 


209 143 240 6 


BO 


GET 


64 


14:55:54 


10 


PDQ 143 240 6 


80 


202 139 16 45 


63450 


image/gif 


403 


14:55:54 


11 


209.143.240.6 


80 


202.139.16.45 


63450 


imaae/oif 


381 


14:55:54 


r 12 


202.139.16.45 


63450 


209.143.240.6 


80 


GET 


63 


14:55:54 




202.139.16.45 


63450 


209.143.240.6 


80 


GET 


65 


14:55:54 


? 14 


209.143.240.6 


80 


202.139.16.45 


63450 


image/grf 


348 


14:55:54 


1 15 


202.139,16.45 


63450 


209.143.240.6 


80 


GET 


65 


14:55:54 


16 


202.139.16.45 


63450 


209.143.240.6 


80 


GET 


65 


14:55:54 


^ 17 


209.143-240.6 


80 


202.139.16.45 


S3A5Q 


image/gif 


354 


14:55:54 


18 


209.143-240.6 


80 


202.139.16.45 


63450 


tmage/gjf 


600 


14:55:54 


19 


202.139-16.45 


53450 


209.143.240.6 


80 


GET 


65 


14:55:55 


20 


202.139.16.45 


S3450 


209.143-240.6 


80 


GET 


54 


14:55:55 


21 


209.143.240.6 


80 


202.139.16.45 


63450 


rmage/gif 


490 


14:55:55 


22 


202.139.1 6.45 


63450 


209-143.240.6 


80 


GET 


53 


14:55:55 




ono i^tQ Ovin 






oohdU 


image/gn 


o /I 


1 4:55:55 


24 


209-143.240.6 


80 


202.139.16.45 


63450 


It tid^c/Ljtl 


'^9P 


t ^.30 .-DD 


25 


20y. l4o.ii4U.D 


OU 


^U^.139.1 D.45 


63450 


image/gif 


571 


14:55:55 




202.139.16.45 


63450 


209.143.240,6 


80 




oz> 




27 


209.143.240.6 


80 


202.139.16.45 


63450 


image/gif 


363 


14:55:55 


28 


202.139.16.45 


63450 


209.143.240.6 


80 


GET 


63 


14:55:55 


29 


202.139.16.45 


83450 


209.143.240.6 


80 


GET 


63 


14:55:55 


30 


202.139.16.45 


63450 


209.143-240.6 


80 


GET 


62 


14:55:55 


31 


209.143.240.6 


80 


202.139.16.45 


63450 


mage/gif 


241 


14:55:56 


32 


202.139.16-45 


63450 


209.143.240.6 


80 


GET 


59 


14:55:56 


33 


209.143.240.6 


80 . 


202.139.16.45 


63450 


mage/gif 


488 


14:55:55 


34 


209.143.240.6 


80 . 


202.139.15.45 


63450 


mage/gif 


463 


14:55:55 


35 


202.139.16.45 


63450 ; 


209.143-240-6 


80 


GET 


59 


14:55:56 


36 . 


209.143.240.6 


80 : 


202.139.16.45 


63450 


mage/gif 


714 


14:55:56 


37 


202,139.16.45 


63450 i 


209.143.240.6 


80 ^ 


GET 


53 


14:55:56 


38 


209.143.240.6 


80 ; 


202.139.16.45 ; 


63450 i 


mage/gif 


35 


14:55:56 


39 . 


202.139.16.45 


63450 ; 


209.143.240.6 


80 ( 


SET . 


53 


14:55:56 


40 


209.143.240.6 


80 ^ 


202.139.16.45 


63450 i 


mage/gif 


1188 


14:55:56 


41 . 


202.139.16.45 


63450 i 


209.143-240.6 


80 i 


SET 


54 


14:55:56 


42 ; 


209.143.240.6 


80 ^ 


202.139.16.45 


53450 t 


mage/gif 


327 


14:55:57 
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Site 5 Objects - 


Content Type "Computer Software" 






Source 


Destination 








Obi'ect 


IP Address 


Port 


ir Moaress 


r on 


- 

Type 


oize 


Trmestamp 


1 


202.1 da. ib.4a 




^£07 .45. 1 o U . 1 4i; 


on 


ot 1 


4L 


14:57:46 


2 


207.46.1 30. 14S 


80 


202.139.16.45 


63450 


text/ntml 




14:57:47 


3 


202.139.16.45 


63450 


207.46.130.149 


80 


GET 


66 


14:57:48 


A 


207.46.130.149 


80 


202.139.16.45 


63450 


application 


18768 


14:57:49 




202.139.16.45 


63450 


207.46.130.149 


80 


GET 


71 




D 


207,46.130.149 


80 


202.139.16.45 


63450 


application 


2941 


r H.u / .OU 


-7 
/ 


202.139.16.45 


63450 


207.46.130.149 


80 


GET 


70 




o 


207.46.130.149 


80 


202,139.16.45 


63450 


appiicatfon 


2812 


I 'f.O/ .O 1 


9 


202.139,16.45 


63450 


207.46.130.149 


80 


GET 


75 


14.57:51 


10 


202.139.16.45 


63450 


207.46.130.149 


80 


GET 


72 


14:57:51 


1 1 


202.139.16.45 


63450 


207.46.130.149 


80 


GET 


80 


14:57:51 
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